Translation & Transform Algorithm of Query Sentence in Cross-Language Information Retrieval

نویسندگان

  • Xiao-fei Zhang
  • Ke-liang Zhang
  • He-yan Huang
چکیده

Based on large-scale bilingual corpora and the theories of vector space model and lexical mutual information, this paper explores the application of the traditional monolingual IR technology to converting the translation of query sentence to the computation of the boost value of query keyword translations in the bilingual dictionary, so that the target language query sentence is reconstructed. The experiment finds a 92.8% precision rate in the first 10 retrieved documents and an 88.9% precision rate in the first 100 retrieved documents.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluating Resources for Query Translation in Cross-Language Information Retrieval

Our goal is to evaluate the utility of a lexical resource containing Lexical Conceptual Structures LCS for use in cross language information retrieval Our evaluation makes use of a combination of techniques from interlingual machine translation Dorr with conventional information retrieval techniques Oard OardandDorr Given a query in one language we transform the query into the corresponding ter...

متن کامل

Sentence Alignment by Means of Cross-Language Information Retrieval

In this chapter, we focus on the specific problem of sentence alignment given two comparable corpora. This task is essential to some specific applications such as parallel corpora compilation Utiyama & Tanimura (2007) and cross-language plagiarism detection Potthast et al. (2009). We address this problem by means of a cross-language information retrieval (CLIR) system. CLIR deals with the probl...

متن کامل

A Quirk Review of Translation Models

The goal of machine translation (MT) is to use a computer system to translate a text written in a source language (e.g., Chinese) into a target language (e.g., English). In this section, we will give an overview of translation models that are widely used in state-of-the-art statistical machine translation (SMT) systems. A nice textbook that provides a comprehensive review is Koehn (2010). Altho...

متن کامل

Free Resources And Advanced Alignment For Cross-Language Text Retrieval

For the Cross-Language Text Retrieval Track in TREC 6, NMSU experimented with a new approach to deriving translation equivalents from parallel text databases, and also investigated performing automatic, dictionary-based translation of query terms by using a dictionary that could be queried remotely via the World Wide Web. The new approach to building bilingual translation lexicons involved alig...

متن کامل

The Use of NLP Techniques in CLIR

The application of nlp techniques to improve the results of information retrieval is still considered as a controversial issue, whereas nlp techniques in cross-language information retrieval (clir) are already well established, at least for the translation part. In this paper, the clir component Mpro-IR is represented which has been developed as the core module of a multilingual information sys...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006